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Negative-strand RNA viruses: Genetic engineering 
and applications 

Peter Palese*, Hongyong Zheng, Othmar G. Engelhardt, Stephan Pleschka, and Adolfo Garci'a-Sastre 

Department of Microbiology. Mount Sfnai School of Medicine. 1 Gustavc L Levy Place, New York. NY 10(129 



ABSTRACT Tkt nepttve-sti-andRNAvunisesarcabn^ 
group of animal viruses that comprise several important 
human pathogens, including Influenza, measles, mumps, ra- 
bies, respiratory syncytial, Ebola, and hantaviruses, The 
development of new strategics to genetically manipulate the 
genomes of negative-strand RNA viruses has provided us with 
new toots to study the structure-Junction relationships of the 
viral components and their contributions to the pathogenicity 
of these viruses. It is also now possible to envision rational 
appro ac hes based on genetic engineering techniques — to 
design live attenuated vaccines against some of these viral 
agents. In addition, the use of different negstive-strand RNA 
viruses as vectors to efficiently express foreign polypeptides 
has also become feasible, and these novel vectors have poten- 
tial applications In disease prevention as well as in gene 
therapy. 



DNA-Containing Viruses 

Among animal viruses, DNA-containing viruses were the first 
to become amenable to genetic engineering techniques. This 
breakthrough was achieved for simian virus 40 when a cloned 
cDNA copy was transfected into cells, resulting in the forma- 
tion of infectious virus (see Tabic I). Transfected mutated 
cDNA molecules gave rise to defined mutant viruses (I). A 
second methodology involving the use of homologous recom- 
bination allowed, for the first time, the rescue of large DNA- 
containing viruses such as herpes viruses (2). In this approach, 
intact herpes viral DNA as well as cloned DNA flanked by viral 
sequences was transfected into cells. Homologous recombina- 
tion between the cloned DNA and the wild-type genome can 
occur, and novel viruses can be selected under appropriate 
conditions. For example, recombinants with DNA fragments 
containing a viral thymidine kinase gene can be selected in 
appropriate cell lines and media, and viruses lacking a thymi- 
dine kinase can be isolated in the presence of nucleoside 
analogs (e.g.. Ara T). This general technique allows the suc- 
cessful construction of viral variants of herpes viruses, and 
similar procedures have been developed for pox viruses (3, 4) 
and other DNA-containing viruses including adenoviruses (5) 
and parvoviruses (6). Finally, strategies have been developed 
to generate infectious as well as mutant viruses by transfecting 
cosmids containing overlapping portions of large viral ge- 
nomes. Viruses arise via recombinati n between the cosmids. 
This system was successfully used to rescue infectious herpes 
simplex I viruses (7), cytomegaloviruses (8) and Epstein-Barr 
viruses (9) from their respective cosmids. 



The publication costs of this article were defrayed in part by page charge 
payment. This article must therefore be hereby marked -adwrtisetnent" in 
accordance with 18 U.S.C. §1734 solely lo indicate this fact 



Positive-Strand RNA Viruses 

RNA-containing viruses belong to a variety of families with 
diverse replication strategies. Unique among the RNA viruses 
are the retroviruses, whose replication involves a double- 
stranded DNA phase, making these viruses an easy target for 
genetic manipulation. Transfection of full-length cDNA mol- 
ecules leads to the establishment of replicating virus particles 
and integration of the viral genetic information into the host 
genome (10). The engineering of retroviral genomes has 
become one of the most successful genetic approaches in 
modern virology and is central to the study both of viral gene 
expression and of protein structure-function analysis. In ad- 
dition, retrovirus constructs are among the most widely used 
vectors for gene transfer and gene therapy (II). 

Most of the other positive-strand RNA viruses are also 
amenable to genetic engineering approaches (Table 1 ). In the 
case of the small and medium sized positive-strand RNA 
viruses, full-length genomic RNA has been shown to be 
infectious when transfected into ceils. Plus-strand RNA serves 
as mRNA for the synthesis of viral proteins as well as template 
for viral RNA replication. Thus, transfection of cloned DNA 
of poliovirus RNA (or of cDNA-derivcd RNA) into permissive 
cells results in the formation of infectious virus particles (12). 

Remarkably successful have been studies using Sindbis viruses 
and Semliki forest virus (13, 14). The cDNA-dcrived RNAs of 
these positive-strand RNA viruses can be used to efficiently 
rescue infectious viruses, thus allowing an extensive analysis of the 
promoter elements of the viral RNAs as wcli as structure- 
function studies of the viral proteins. Furthermore, these viruses 
have received increased attention because of their potential for 
expressing copious amounts of heterologous genes via recombi- 
nant constructs. Up to 10* molecules of heterologous protein per 
cell have been expressed using these systems.* 

Introduction of cDNA-Derived RNA into a Negative-Strand 
RNA Virus (Influenza Virus) 

The life cycle of negative-strand RNA viruses differs from that 
of the other RNA viruses in many ways. Specifically, the 
genomic RNA of negative-strand RNA viruses is not infec- 
tious, and infectious virus particles must also deliver their own 
RNA-dcpendent RNA polymerase into the infected cell to 
start the first round of virus-specific mRNA synthesis. 

Thus, approaches different from those used for positive- 
strand RNA viruses had to be developed to allow the rescue of 



Abbreviations: RNP, rihonuclcoprotcin; HA. . . mugglutinin; NA. 
neuraminidase. VSV, vesicular stomatitis virus. 
•To whom reprint requests should be addressed, e-mail: ppalescC" 
smtplink.mssm.edu. 
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Tabic 1. Genetic engineering of animal viruses 



Type of genome 



Prototype viruses 



dsDNA 



ssDNA 
ssRNA 



Simian virus 40, herpes, 
adenovirus, poxvirus 



Strategics 



Transection of cDNA; homologous 
rccomhination using cloned DNA and intact 
viral DNA or helper viruses; transfcction of 
cosmids containing viral genes 
Transection of plasmids containing AAV genes 
Transfcction of infectious cDNA 



Adcno-associated virus (AAV) 
Retrovirus 

Plus-sense RNA 

Picornavirus. Scmliki forest Transfcction of cDNA-dcrivcd infectious RNA 
vims, Sindbh virus 

Minus-sense RNA 



Influenza virus, rhabdovirus, 
parainfluenza virus, 
hunyavirus 



dsRNA 



Transfcction of reconstituted ribonuctcoprotcin 
in the presence of helper virus; rescue of 
virus from cDNA clones transcribed in viim 
or in vivo in the presence of helper virus or 
of viral polymerase proteins expressed 
intraccllularty in trans 



ds. Double stranded: ss, single stranded. 

genetically engineered viruses of these virus families (Table I ). 
Sitc-spcrificafiy altered influenza viruses were first obtained 
by reconstituting /n vitro a biologically active ribonudcoprotein 
complex (made of synthetic RNA and purified nucJcoprotetn 
and polymerase proteins) and then transferring the complex 
into helper virus-infected cells (Fig. ]) (15). The helper virus 
provides in trans the viral proteins required for amplification 
of the synthetic RNP complex. Subsequent reassortmcnt of the 
synthetic gene and helper virus-derived RNA segments, fol- 
lowed by selection for the reassortant (transfectant) virus, 
allows the introduction of site-specific changes into the ge- 
nome of influenza viruses (16). Selection of the transfectant 
virus can be achieved by choosing host range or temperature- 
sensitive mutants as helper, viruses. Alternatively, antibody 
preparations specific for the viral surface proteins can be used 
to select against the helper virus or for these novel viral 
constructs. Following such protocols, six of the eight genes 
(PB2, hemagglutinin (HA), neuraminidase (NA), NP, M and 
NS] of influenza A viruses and the HA of an influenza B virus 
have now successfully been altered by genetic engineering 
methods (17-22). 

Plasmid- Based Reverse Genetics System 
for Influenza Virus 

A method was recently developed to reconstitute a biologically 
active influenza virus KNf complex within a ceti rather than 
in vino. This alternative approach avoids the need to purify 
viral proteins and to transfect an RNA-protein complex into 
cells; instead, this method involves the transfection of plas- 
mids. The first plasmid contains a human polymerase I pro- 
moter and a hepatitis delta virus-derived ribozyme sequence 
which flank the synthetic influenza virus gene. The polymerase 
I-driven plasmid is cotransfected into human cells with poly- 
merase II-responsive plasmids expressing in trans the viral 
PB1, PB2, PA, and NP proteins. Such a system involving the 
use of five plasmids allows the amplification and expression of 
a synthetic influenza virus gene and takes advantage of the 
convenience of plasmid transfections as compared with RNP 
transections (23). Using this approach, it was possible t 
rescu a synthetic NA gene into a recombinant influenza A 
virus. A synthetic HA gene has also been rescued by this novel 
technique (Fig. 2) (A.G.-S., unpublished results). It should be 
noted, however, that this plasmid-based reverse genetics sys- 
tem still relies n the presence of a helper virus which provides 
the genetic backbone int which the plasmid-derived gene can 
be introduced. 



Chimeric Influenza Viruses Expressing Foreign 
Epitopes or Polypeptides 

The development of methods to rescue synthetic RNAs into 
the genomes of influenza viruses allowed the construction of 
chimeric viruses expressing a variety of foreign epit o pe s . 
Specifically, epitopes derived from HIV, rrtasmodia. or lym- 
phocytic choriomeningitis virus proteins were successfully 
expressed in cither the HA or the NA of different influenza 
viruses (16, 24). Such constructs were shown to induce a potent 
B-cell and/or T-ccll response against the foreign epitope in 
experimental animal systems. Specifically, Li el aL (25) gen- 

RE&CUg OF INFECTIOUS HICT MCMT^ Vff|ffiff| 




transfectant 
virus 



Fig. I. A reverse genetics system far the rescue of infectious 
influenza viruses containing cDNA-dcrivcd RNA. The method allows 
the substitution of one of the eight genomic RNA segments of the virus 
by a synthetic RNA. A biologically active viral ribonucleoproictn 
complex (RNP) is made in \ittv by mixing cDNA-dcrivetl RNA with 
purified viral nuclcoprotcin and polymerase proteins. The RNPs arc 
transfected into cells which have been previously infected with an 
influenza helper virus. Using a selection method, viruses containing 
the genetically engineered RNP (transfectant viruses) can he isolated. 
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PI ASMIP-BASgD REVERSE GENETICS 
SVSTFM FOR INFLUENZA VIRUS 



PB1 




HA 



+ 

HELPER VIRUS 



TRANSFECT ANT VIRUS 

Fig. 2. A plasmid-bascd reverse genetics system for the rescue of 
infectious influenza viruses containing a genetically engineered seg- 
ment. Cells arc transfected with four plasmids that are able to express 
the viral NP and polymerase (PB2, PB1, and PA) proteins from a 
cellular polymerase ll-rcsponsivc promoter (pol II). An additional 
plasmid which contains, for example, the HA open reading frame 
flanked by the 5' and 3' noncoding regions of the viral RNA segment 
(black boxes) is cotransfected. The HA plasmid is able to express an 
H A-spccific viral RNA by transcription from a polymerase I -respon- 
sive promoter (pol I) followed by the rihozymc (RZ)-mediatcd 
cleavage of the transcript. The H A-specific RNA segment is intracel- 
lular!)* complcxed with the NP and polymerase proteins to form RNPs 
that can be rescued into a transfectant virus if the cells are also infected 
with an influenza helper virus. Selection of the transfectant viruses can 
be performed by using neutralizing antibodies against the HA protein 
of the helper virus. 

crated a recombinant influenza virus that expressed a CD8 * 
T-cell epitope derived from the circumsporozoitc (CS) protein 
of Plasmodium yoelii in its HA. Mice immunized with this 
transfectant virus made a vigorous cytotoxic T lymphocyte 
response against this epitope (25). By boosting mice with a 
recombinant vaccinia virus expressing the CS protein, it was 
possible to achieve protective immunity (60%) against chal- 
lenge with live P. yoelii sporozoites. Additional protective im- 
mune responses were generated by immunizing mice with trans- 
fectants expressing B-cell-speciftc epitopes located in the repeat 
region of the CS protein of P. yoelii Up to 80% of immunized 
mice were immune to challenge with one hundred P. yoelii 
sporozoites (26). 

Foreign epitopes can be inserted into several sites on the HA 
molecule of influenza viruses, and most conveniently into the 
stalk region of iuc NA. in fact, stretches of more than Ktl 
foreign amino acids have been successfully inserted into the 
stalk region of the NA (27, 28) (S. Itamura, personal commu- 
nication). Although some of these constructs show interesting 
biological properties, this approach of epitope grafting has its 
limitations in terms of the size and the nature of the epitope 
that can be expressed (since the chimeric protein may affect 
the viability of the recombinant virus). 

A generic approach to the expression of foreign proteins is 
the construction of bicistronic genes which can be packaged 
into infectious particles. The foreign gene can replace the open 
reading frame of one f the influenza virus genes and the 
respective influenza virus protein is then translated from an 
internal ribosome entry site (IRES element) on the genetically 
engineered gene. Alternatively, the foreign protein can be trans- 
lated from an internal IRES sequence. Express! n f several 
f reign polypeptides was achieved in this way (16, 29). However, 
many constructs did not result in viable viruses (unpublished 
results). Attempts are currently being made to identify the factors 
which determine the limitati ns of this approach. 



The second method for the cxpressi n of foreign proteins 
takes advantage of autoproteolytic elements placed within a 
fusion protein. For example, a virus was constructed that 
expresses a fusion protein c nsisting of the full-length chlor- 
amphenicol acetyltransferase (CAT) protein, the 2A protease 
of foot and mouth di&ease virus, and the viral NA (30). This 
virus was stably passaged and expressed copious amounts of 
CAT protein in infected cells. However, in all cases of the 
fusion protein constructs, the foreign protein contains a 16- 
amino acid extension derived from the 2A protease which may 
alter the biological properties of the foreign protein. 

Rescue of Infectious Rabies Virus rrom cDNA 

Like the segmented negative-strand RNA viruses, the Monon- 
egavirales group packages its own RNA-depcndcnt RNA 
polymerase into virus particles to initiate viral RNA synthesis. 
Thus, naked RNA alone is unable to drive the replication cycl . 
Several approaches were taken to rescue model and full-length 
RNAs. First, a Scndai virus-like RNA transcript was amplified 
and expressed by transfecting the naked model RNA into 
Scndai virus-infected cells (31 ). This experiment suggests that 
complementation in traas by the viral polymerase complex is 
required for the amplification and expression of the viral 
RNA-likc reporter gene. Subsequently, in a remarkable study. 
Schncll et al (32) succeeded in constructing a plasmid that 
expresses a full-length rabies virus RNA transcript from a 17 
RNA polymerase promoter. The plasmid DNA containing this 
viral insert was transfected into cells infected with a recom- 
binant vaccinia virus expressing the 17 polymerase. Three 
other plasmids expressing the rabies virus N, P and I- proteins 
were also cotransfected into these cells. In this recombinant 
vaccinia virus-driven system, the presence of the viral poly- 
merase complex and of a full-length viral RNA (in plus sense) 
led to the formation of recombinant rabies virus. 

This system has been elegantly exploited to study the 
promoter elements of rabies virus RNA and to elucidate the 
interaction of this interesting virus with cells (33). Surprisingly, 
cells infected with a mutant lacking the virus* only glycoprotein 
(G) were still able to hud from the cell surface, albeit at a 
30-fold lower efficiency (34). This experiment revealed that the 
surface protein G exhibits an intrinsic exocytotic activity. The 
system was further developed to show that a hybrid G/HI V-l 
glycoprotein was able to form pseudotypes with the "G-lcss" 
particle, thus changing the host range by restricting infection 
to CD4* cells. This experiment clearly demonstrates thai 
genetic engineering can redirect the host range and cell 
tropism of rabies viruses. This should prove helpful for the 
development of novel vaccines as well as for gene therapy. 

Rescue of Other Nonsegmented Negative-Strand 
RNA Viruses 

An effective DNA transfection system has also been developed 
for another rhabdovirus. vesicular stomatitis virus ( VSV) (35. 
36) (Fig. 3). Again, the polymerase complex (N. P. and L 
proteins) was expressed in cells from plasmids transcribed by 
a 17 RNA polymcrasc-containing vaccinia virus recombinant. 
Recombinant VSVs expressing an additional transcriptional 
unit were rescued and high-expression levels of heterologous 
proteins were achieved (37). In a dramatic experiment, the 
auth rs were able to construct a recombinant VSV expressing 
the CD4 protein. This protein was pack at ! at levels of up to 
30% of the G protein itself, and the recombinant particle had 
an 18% greater length than wild-type virus due to the extra 
gene. These results illustrate that VSV is an effective vect r t 
express foreign proteins at high levels, and that the virus is 
tolerant to the inscrti n of novel transcriptional units. Reverse 
genetics systems have also been developed for paramyxovi- 
ruses. In the case f measles virus, a cell line constituttverv 
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(Vaccina virus/Call llnas) 

T7 POLYMERASE 
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1. Plasmids expressing 
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INFECTIOUS 
VIRUS 

Fici. 3. Reverse genetics systems for the rescue of infectious 
nonsegmcnted negaiive-stnind RNA viruses from cDNA. Transcrip- 
tionally competent viral RNPs are made by cellular expressiofl of the 
viral proteins N. P. and L. This can be achieved by a variety of methods, 
including vaccinia vii its-driven expression and/or complementing cell 
lines constituiively expressing T7 polymerase and viral proteins. The 
full-length viral RNA can he provided hy transfcrting plasmids ex- 
pressing antjgenomic or genomic RNA or by directly transfecting 
naked RNA (plus-sense or minus-sense). The' intracellular^ assem- 
bled RNPs arc transcribed and replicated hy the viral polymerase 
complex |N. P. and L proteins) generating infectious viruses. 

expressing 17 polymerase and the measles N and P proteins has 
been used for the rescue of infectious virus from full-length clones 
(38) and vaccinia virus-based systems have allowed the rescue of 
respiratory syncytial virus (39) and of Scndai viruses (40. 41). 

Most of the earlier systems developed for the nonsegmcnted 
viruses used the intracellular expression of antigenomtc plus- 
sense RNA as the template to initiate the replication cycle. 
Either the plus-sense RNA was transcribed by the 17 poly- 
merase expressed by a vaccinia-recomhinant virus (32, 35. 36. 
39-41)* or transcription was driven by the 17 poly me rase 
which was permanently expressed in cells (38). Recently, an 
important series of experiments showed that intracellular 
expression of a full-length transcript generated infectious 
Sendai virus regardless of whether the plus-sense or the 
minus-sense RNA was transcribed (41). Success appears to 
have come from fine tuning the system in terms of the 
c nccntration of the polymerase components (N. P. and L 
proteins) and from constructing plasmids giving rise to tran- 
scripts with 5' and 3' ends identical to those of the wild-type 
RNA. Optimization of the system also involved the use of the 
vaccinia virus inhibitors, cytosinc arabinosidc and rifampicin. 
These compounds reduced the cytotoxicity of vaccinia virus 
and resulted in a dramatic increase of the expression levels of 
a Sendai virus RNA-like reporter gene. Most interesting was 
the finding that recovery of infectious Sendai virus was also 
possible by transfecting naked RNA. The efficiency of recov- 
ery appeared to be lower using plus-sense RNA than the 
genomic minus-sense RNA (41). The latter results involving 
the use f naked RNAs extend the earlier findings that 
transfecti n of naked model RNAs alone results in the effi- 
cient amplification and expression of these minigencs in cells 
infected with Sendai virus (3 1 ). rcspirat ry syncytial virus (42) 
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or parainfluenza virus 3 (43, 44). In the future, improvements 
in the transfection systems to generate novel viruses with case 
will provide us with even better tools for the studv of negative- 
strand RNA viruses. 

Perspective 

The ability to genetically alter negative-strand RNA viruses 
has already enhanced this field of virology and may have a 
major influence on future developments in vaccines, gene 
therapy, cancer treatment, and manufacture of biologicals. 
First, structure-function studies of individual viral genes arc 
now possible in the context of an infectious virus for a number 
of negative-strand RNA virus families. These groups consist of 
many medically important viruses including measles, mumps, 
respiratory syncytial, parainfluenza, influenza, and hunvavi- 
ruses. In the recent past we tried to take a reductionist 
approach in virology; viral genes were studied in isolation by 
cloning and expressing them in different systems. The pendu- 
lum has now swung back in the other direction a? we ask 
questions about how viral genes and gene products interact 
with host cell components and the host in general. This can 
best be done by studying genetically defined viruses and 
subjecting them to directed mutational analysts. These viral 
constructs arc then available for biochemical analysis as well as 
for studying replication and growth in tissue culture or exper- 
imental animals. Obviously, structure-function studies of viral 
genes also include the analysis of promoter elements and other 
noncoding sequences. 

Second, genetically engineered negative-strand RNA vi- 
ruses should become candidates for use as live virus vaccines. 
Genetically engineered influenza viruses with changes in 
coding or noncoding sequences may induce immune responses 
which arc longer-lasting and more protective than those gen- 
erated by conventional influenza virus vaccines. In the case of 
respiratory syncytial and parainfluenza viruses, a recombinant 
DNA approach may be the only rational strategy, since the 
Jennerian approach has not resulted in acceptable vaccine 
candidates. Thus, tools are now available to design a new 
generation of vaccines for the medically important negative- 
strand RNA viruses. 

Third, negative-strand RNA viruses may become useful 
vectors for the expression of foreign genes. Recombinant 
influenza viruses ( 16), rabies viruses (45). and VSV (37) have 
been used to express additional protein sequences or foreign 
genes. Packaging limitations and restrictions due to the length 
or the nature of the foreign gene are not yet defined for 
negative-strand RNA virus constructs, nor do we have suffi- 
cient information about the stability of these viruses once their 
genome structures have been extensively altered. These un- 
certainties notwithstanding, there is a major advantage in the 
use of negative strand RNA viruses as vectors (or as vaccines). 
These viruses do not go through a DNA phase and thus cannot 
transform cells hy integrating their genetic information into 
the host cell genome. Furthermore, homologous recombina- 
tion has never been observed for any of the ncgativc-sirand 
RNA viruses. Thus, replication-incompetent viral constructs 
grown in complementing cell lines should be free of contam- 
inating virus generated by a recnmbinational event. In terms 
of safety, these properties weigh heavily in favor of negative- 
strand RNA virus vectors. 

Novel viruses expressing foreign genes may serve prophy- 
lactically as vaccines, or they may play a role in gene therapy 
when a transient expression would he beneficial. The latter 
may be the case in cancer therapy, which could require a 
targeted infection followed by the induction of a lethal cyto- 
pathic effect. Repeated administration of negative-strand 
RNA viruses may not be feasible in this situation because of 
the host's immune response. However, the choice of different 
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antigenic variants (us is possible with influenza viruses) may 
overcome this limitation. 

Finally, the highly efficient expression of viral and foreign 
proteins via negative-strand RNA virus vect rs may have 
additional bi technological applications. It is possible that 
defective RNA constructs could be used for genetic immuni- 
zation. This form of vaccination would resemble DNA vacci- 
nation (46) in that the defective particle would go through 
many replication rounds and persist without spreading to 
neighboring cells. Such RNA replicons may have interesting 
biological properties since the efficiency of infection should be 
comparable to that of whole viruses. Also, replication com- 
petent viral vectors may help in the manufacture of large 
quantities of biological reagents, since the quantities expressed 
by negative-sense RNA viruses can be high. It is also possible 
that purification of expressed proteins could be made easier if 
they were incorporated into extracellular virus particles. 

The solutions to many of the issues discussed here will 
depend on the continuing success of basic science and the 
development of novel strategies to study viruses. Our horizons 
must expand and include the analysis not only of the viruses but 
also of their interactions with the host cell. Only by continuing 
to study these fundamental processes may we hope to reap the 
benefits offered to us by these new opportunities. 

Work done in this laboratory was supported hy National Institutes 
of Health grants to P.P. 
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ABSTRACT In late 1994 and early 1995, Ebola (EBO) 
virus dramatically reemerged in Africa, causing human dis- 
ease in the Ivory Coast and Zaire. Analysis of the entire 
glycoprotein genes of these viruses and those of other EBO 
virus subtypes has shown that the virion glycoprotein (130 
kDa) is encoded in two reading frames, which are linked by 
transcriptional editing. This editing results in the addition of 
an extra nontemplated adenosine within a run of seven 
adenosines near the middle of the coding region. The primary 
gene product is a smaller (50-70 kDa), nonstructural, se- 
creted glycoprotein, which is produced in large amounts and 
has an unknown function. Pbytogenetic analysis indicates that 
EBO virus subtypes are genetically diverse and that the recent 
Ivory Coast isolate represents a new (fourth) subtype of EBO 
virus. In contrast, the EBO virus isolate from the 1995 
outbreak in Kikwit, Zaire, is virtually identical to the virus 
that caused a similar epidemic in Yambuku, Zaire, almost 20 
years earlier. This genetic stability may indicate that EBO 
viruses have coevolved with their natural reservoirs and do not 
change appreciably in the wild. 

The 1995 epidemic of Ebola (EBO) virus disease in Zaire ( 1 ) t 
coupled with the discovery of an EBO virus in the Ivory Coast 
in late 1994 (2), has sparked new scientific and public interest 
in these mysterious and highly pathogenic viruses. EBO viruses 
are nonsegmented negative-strand RNA viruses that are ge- 
netically related to, but distinct from, the Marburg (MBG) 
virus (3). These viruses, classified in the genus FUovirus in the 
family tf loviridae, cause a severe hemorrhagic disease in 
human and nonhuman primates (4, 5). The EBO group of 
filoviruses comprises three discrete subtypes: Zaire, Sudan 
and Reston (EBO-Z, EBO-S, and EBO-R, respectively). In 
late November 1994, EBO virus was discovered in the Tai 
Forest of the Ivory Coast (1). This virus (EBO-IC) caused a 
single nonfatal human infection and was believed to have 
caused increased mortality in a troop of wild chimpanzees. 
Preliminary serological analysis of the EBO-IC isolate suggests 
that it may represent a fourth subtype (ref. 2; Centers for 
Disease Control and Prevention, unpublished data). The much 
publicized outbreak of human disease in the city of Kikwit, 
Zaire, in late Spring, 1995, involved over 300 cases with a 77% 
mortality rate (1, 6). The epidemic was caused by a strain of 
EBO-Z, the most pathogenic subtype of EBO virus, which had 
not been reported for 18 years. 

The virion surface glycoproteins (GPs) of filoviruses are 
multimers of a single structural GP; they are important in the 
binding of virions to cell receptors and virus entry into the cell 
cytophsm (3). This protein is expressed from the GP gene, 
which in the genomes of filoviruses is positi ned fourth (from 
the 3' end) of seven linearly arranged genes (3, 7). Filovirus 
GPs are highly glycosylated, containing both N-Iinked and 
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O-hnked carbohydrates (3, 8-10) that contribute from one- 
third to one-half of their relatively large molecular weight ( M r 
* 130-170 kDa). 

To better define the relationship of EBO viruses to one 
another, we have investigated the structure and expression of 
their GP genes, t Here we describe an unusual organization of 
the GP genes of all EBO virus subtypes, present a phylogenetic 
profile for the family Filoviridae, and discuss the role of GP 
gene products in the pathogenesis of EBO virus disease. 

MATERIALS AND METHODS 

Viruses. The initial EBO-R isolate (R/USA/Reston/1989/ 
119810 strain) was obtained from P. B. Jahrling (U.S. Army 
Medical Research Institute for Infectious Diseases, Fort De- 
trick, MD) and was isolated and passaged once in MA- 104 
cells, plaque purified three times, and passaged once in Vero 
E6 cells. TVo 1992 EBO-R viruses [R/Philippines/Manila/ 
1992/920084, isolated at Centers for Disease Control and 
Prevention (Atlanta), and R/Italy/Siena/1 992/1 2552, ob- 
tained from D. Brown (Public Health Laboratory Service, 
Virus Reference Laboratory, London)] were passaged 2 to 3 
times in Vero E6 cells and once in MA-104 cells. A 1976 
EBO-Z virus (Z/Zaire/Yambuku/ 1976/057935 also referred 
to as the Mayinga isolate) was passaged once in Vero cells and 
once in Vero E6 cells. The passage histories of other EBO-Z 
viruses (Z/Zaire/ Yambuku/ 1976/057878 and Z/Zaire/ 
Tandala/1977/088296, also referred to as the Eckron and 
Bonduni isolates, respectively) and EBO-S viruses (S/Sudan/ 
Maridi/1976/VCP2D11 and S/Sudan/Nzara/1979/015176, 
also known as the Boneface and Maleo isolates, respectively) 
are described elsewhere ( 1 1). The EBO-IC (IC/Ivory Coast/ 
Tai Forest/1994) was obtained from B. Le Guenno (Institute 
Pasteur, Paris), and was passaged 2 to 5 times in Vero E6 cells. 
The EBO-Z virus from the 1995 epidemic in Kikwit (Z/Zaire/ 
Kikwit/1995/9510621) was isolated at the Centers for Disease 
Control and Prevention in Vero E6 cells inoculated with a 
blood specimen from an acutely infected patient (who later 
died). However, all sequence data related to this strain were 
derived from RNA extracted from the same human blood 
specimen. 

Viral RNA Purification, Amplification, and Cloning. Prep- 
aration of genomic RNA (vRNA) from purified EBO virions 
and the extraction of total infected cell RNA were performed 
as described (12). Syntheses of cDNA from the vRNA of 
EBO-R (Reston/1989) and EBO-S (Nzara/1979) were per- 
formed (13), followed by blunt-end ligati n into the Sma I site 

Abbreviations: EBO, Ebola; MBG, Marburg; TCF, tissue culture 
LtLST' glycoprotein; SGP, small/secreted glycoprotein precursor; 
RT-PCR; reverse transcription-PCR; PMSF, phenylmethyisulfonyl 
fluoride; ORF, open reading frame; RIP, radioimmunoprecipitation. 
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of pUC18 or pSP73/pSP72 vectors (Pr mega) and cloned as 
previ usly described (14). Sequencing of cl ned plasmid prep- 
arations was perf rmed using the dideoxy chain terminati n 
method (15). A manual radiolabeling method employing 
cloned bacteriophage 17 DNA polymerase (Sequenase) and 
an automated noniso topic method (dye-terminator cycle se- 
quencing; Perkin-Elmer) was used. Direct (automated) se- 
quencing of DNA amplified from viral sequences by reverse 
transcription-PCR (RT-PCR) was used in sequencing the 
EBO-S strains, EBO-IC, the Kikwil/1995 strain of EBO-Z, 
and the 1992 EBO-R isolates. RT-PCR amplification of pu- 
rified vRNA sequences was performed using GenAmp RNA 
PCR kits (Perkin-Elmer) or reactions were assembled using 
reagents and buffers obtained from commercial sources (Pro- 
mega and Boehringer Mannheim). 

Quantitation of Transcriptional Editing. Editing of GP 
gene transcripts was quantitated for EBO-R (Reston/1989) 
and EBO-Z (Yambuku/ 1 976/057935) by RT-PCR amplifica- 
tion of GP sequences from crude infected cell RNA (mRNA) 
preparations. First-strand cDNA synthesis was performed by 
heating approximately 5 fig of total infected cell RNA and 300 
ng of the primer 5'-CCGGTACC(Ths at 65°C for I min and 
placing the mixture at 42°C; the remaining components were 
then added (25-fd reaction volume). Primer extension with 
reverse transcriptase proceeded for 40 min, followed by adding 
100 /il of sterile water and boiling for 3 min. Approximately 
5-10 jil of this diluted reaction solution served as template for 
the amplification of viral sequences. For PCR amplification of 
the EBO-R GP gene editing region, the primers 5'-CCTTAG- 
CAACAGTACAGGGAGAT and 5'<KXGTCTCTTGCGG- 
GTCCTGG were used. Fc: the EBO-Z GP gene, the primers 
5'-CTGGATCCAACAACACAATGGGCGTTACAGGAAT 
and 5 A ATCGTA(XG ACTCGTGGAG ATTGTGG were 
used to amplify the editing region. The EBO-R PCR product was 
cleaved with the restriction enzyme BamHl and iigatcd into the 
BamHX site of pUCl& The EBO-Z PCR product was cleaved 
with Kpn I followed by BamHl, directionally Iigated into pUCl&, 
and isolated clones sequenced 

* To determine if virion (genomic) RNA might contain eight 
or more uridines at the editing site, vRNA was extracted from 
a human blood specimen (from which the Kikwit/1995 virus 
was isolated) and used as template in a RT-PCR to amplify a 
DNA fragment containing the editing site. The primer 5'- 
GGACCCGTCTAGTGGCTACTATTC (plus-sense) was 
used to prime first-strand cDNA synthesis from vRNA tem- 
plate molecules, as above, then the reaction was boiled 3 min 
before its use in PCR amplification. The above primer was then 
used with the negative-sense primer 5 '- CCA ATGCAT G AT- 
GCGACACTGCAGCTTCCC (nonviral sequences are under- 
lined) to amplify a 482-bp fragment containing the editing 
region. The RT-PCR product was isolated by agarose gel 
electrophoresis and extracted from the gel using a commercial 
kit [Q1AEX II, Qiagen Inc. (Chatsworth, CA)]. Isolated DNA 
was directly Iigated into the plasmid pCRII using a TA 
prokaryotic cloning kit (Invitrogen). Cloned plasmid DNA was 
isolated and individual clones were sequenced. 

in Vitro Expression of EBO-R GP Gene Products. In vitro 
expression of EBO-R (Reston/1989) GP gene sequences was 
performed by RT-PCR amplification of the GP open reading 
frame (ORF) from total infected-cell RNA using the primers 
5 ' -ATACCCGGGCCCCA A ATTACCTATACA ACA and 
5'-TTTTCTAGAATATTAATCATTATTAAGAGA. Am- 
plified products were digested with Xma I and Xba I and 
directi nally Iigated int pSP73,cl ned, and full-length inserts 
containing seven r eight adenosines in the editing site were 
isolated (pSP73-RESGP-7A and pSP73-RESGP-8A, respec- 
tively). Plasmids were linearized with //mdlll transcribed with 
SP6 RNA polymerase, DNase-treated, and run-off transcripts 
were in vitro translated using a rabbit reticulocyte tysate system 
(Promega). 
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fedMmmuaop^ (RIP) of EBO Virus GP*. Mono- 

layers f MA-104 ceils in 24-well panels were infected with EBO 
viruses (s!0 plaque-f rming units per ceO) or mock infected. 
Cells were incubated for 3 to 4 days, washed with Dumecco's 
minimal essential medium deficient in methionine and/or cys- 
teine containing 2% dialyzed fetal bovine serum and antibiotics, 
and 300 jd of the same medium plus 20 jiG/nri (^methionine/ 
cysteine. Cells were incubated for 4 hr, then tissue culture fluid 
(TCF) was removed and the cells were washed as iibove. The cdl 
monolayer and 900 jd of harvested TCF were treated with 100 p] 
\% SDS for 1 min, followed by the addition of 900 jd 1.1 x TNE 
buffer (ix TNE = 10 mM Tris-Hd/150 mM Nad/3 mM 
EDTA, pH 7.6) containing 1.1% Nonidet P-40, 055% sodium 
deoxycholaic, and 1.1 mM phenylmcthylsttlfonyl fluoride 
(PMSF). Triton X-100 treatment was performed by the addition 
of 1 x TNE containing 1% Triton X-100 and 1 mM PMSF to cells; 
TCF was made to 1 % with Triton X- 100 and to 1 mM PMSF. RIP 
assays were performed on 500 jd of solutions as deserted ( 14). 
Antibodies used in RIP assays were: (i) an anti-EBO-R GP/ 
small/secreted glycoprotein precursor (SGP) monoclonal anti- 
body (high-titered mouse ascitic fluid; cross-reactive with GP and 
SGP) prepared essentially as described (16), and (tf) a fugh-thered 
polyclonal mouse anti-EBO-Z GP produced against a GP ex- 
pressed by a recombinant baculovirus (10). Endogrycosidase 
digestion of RIP products was performed as described elsewhere 
(17). 

Computer-Aided Sequence Analyses. Computer analyses of 
nucleic acid and predicted amino acid sequences for die GP 
genes of EBO subtypes were performed using the Genetics 
Computer Group (Madison, WI) Sequence Analysis Software 
Package (Version 7J-AXP) run on a Digital AXP Alpha 
Workstation. The pileup program was used to align nucleotide 
sequences with a GapWeight setting of 5.0 and a GapLength- 
Weight setting of 0.5. Phylogenetic analysis of nucleotide 
alignments was performed using the paup software Version 
3. 1 . 1 (developed by D. L. Swofford) run on a Power Macintosh 
model 8100/110. 

RESULTS 

The sequences of the entire GP genes of EBO-Z, EBO-S, 
EBO-R, and EBO-IC were determined from cloned cDNA 
and RT-PCR amplified DNA products (see Fig. 1 legend for 
GenBank accession numbers). The EBO-IC strain was found 
to differ substantially from the other subtypes of EBO virus 
(<60% identity) and represents a fourth subtype. The orga- 
nization and specific features of these genes are shown in Fig. 
1. The most unusual finding was that the GPs of all EBO 
subtypes are encoded in two frames and that in each case an 
SGP was predicted as the primary gene product 

The GP genes of all EBO viruses begin and end with 
conserved transcription start and stop (poryadenylyiation) sites 
(14). The only difference in GP gene organization seen in the 
four subtypes is found in EBO-R. Foi the other subtypes, the 
transcription stop site overlaps the transcription start sites of 
the downstream ( VP30) gene and is centered on the common 
pentanucleotide sequence 3 -UAAUU found in all transcrip- 
tional signals (3 # -CUACUUCUAAlIUCUUUUU). In con- 
trast, the GP gene stop site of EBO-R is separated from the 
downstream gene by a short intergenic region of three nucle- 
otides that is followed by a conserved start site (3 f -TJAAU- 

UCUUUUU-GAA-UACUGCUUCUAAUU). 

The SGP and GP share the sam N-terminal ~300 residues, 
but have unique C-terminal sequences (Fig. 2). Unlike that f 
the GP, the C terminus of the SGP lacks a transmembrane 
anchor sequence, which results in its secretion from infected 
cells. The unique C-terminal portion of the SGP is hydrophOic 
and rich in charged residues (mostly positive). The entire SGP 
sequence for EBO subtypes is relatively conserved, but the 
central area f the GP contains a large variable hydrophilic 
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Fio. 1. Schematic representation of the GP gene organization for the EBO-Z, EBO-R, EBO-S, and EBO-IC subtypes. GcnBank accession 
numbers for EBO virus sequences are U23187 (Yambuku/1976/057935), U28077 (Kikwit/1995), U23152 (Reston/1989), U23416 (Manila/1992; 
ORF only), U23417 (Siena/1992; ORF only), U28134 (Maridi/1976), U23069 (Nzara/1979), and U28006 (Tai Forest/1994). Shown are the 
conserved transcriptional start and stop signals, coding regions, and the site of transcriptional editing. An alignment of the region containing the 
transcriptional editing sites for the four subtypes of EBO virus is also shown (EBO-Z sequence shown = 1012-1036). All sequences arc shown in 
the minus sense. The scale at the bottom represents the length of the EBO-Z OP gene (2408 nucleotides). 



region (Fig. 2) where more than half of the potential N-linked 
grycosylation sites and the putative region for O-linked gryco- 
sylation are located. The rest of the GP sequence is conserved 
and is more hydrophobic. In addition, all the cysteine residues 
in the SGP and GP are conserved (see asterisks in Fig. 2). One 
common feature found in the GPs of all filoviruses is a highly 
conserved immunosuppressive motif (26 residues) in the C- 
terminai third of the GP that has a high degree of homology 
to those found in the envelope glycoproteins of oncogenic 
retroviruses (9, 18). 

Since sequence analysis indicated that the GPs of EBO 
viruses are encoded in two frames, it was important to 
determine the mechanism(s) by which the GP is expressed. 
The original published description of the EBO-Z (Yambuku/ 
1976/057935) GP gene (7) was based on a virus stock that had 
undergone three rounds of plaque purification and was later 
determined to contain a single base insertion (extra uridine 
nucleotide) that joined the GP frames. Because a single base 
insertion led to the connecting of the frames in the mutant 
(plaque purified) EBO-Z, it was thought that RNA editing at 
or near this site, in a manner similar to that described for the 
P genes of paramyxoviruses (19), may be responsible for the 
generation of GP. This region of the EBO-Z (Yambuki/1976/ 
057935) and EBO-R (Reston/1989) GP mRNA were RT-PCR 
amplified and cloned, and the isolated clones were sequenced. 
From a total of 81 EBO-R and 41 EBO-Z clones, an addition 
of a single nontemplated adenosine in the mRNA at the 
suspected site was found in 13 (16%) and 11 (27%) of the 
clones, respectively. Fig. 1 shows a short alignment of the GP 
gene region containing the editing site for the four subtypes of 
EBO virus. To eliminate the possibility that the insertion of an 
extra adenosine merely reflects template (vRNA) variability, 
RT-PCR was directed at negative sense EBO-Z (Kikwit/1995) 
RNA isolated from a human blood sample. Sequencing of 32 
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clones showed no insertions in the template vRNA at the 
editing site, indicating that the genomic sequence is homoge- 
neous and transcriptional editing is responsible for the heter- 
ogeneity seen in the GP mRNA clones. 

To confirm that the synthesis of SGP and GP are directed 
by unedited and edited transcripts, respectively, in vitro ex- 
pression of the EBO-R (Reston/1989) GP ORF was per- 
formed. Two plasmids were used to generate run-off tran- 
scripts, one that contained seven adenosines ?.t the editing site 
(mRNA sense) and one that had eight adenosines (GP frames 
connected). Products of in vitro translation of run-off tran- 
scripts are seen in Fig. 14. Translation of the RNA containing 
seven adenosines (unedited form) resulted in the synthesis of 
an unglycosylated protein that is close to the predicted size of 
the SGP backbone (41.8 kDa) (Fig. 14, lane 1). This prot in 
was processed to a higher molecular weight when the transcript 
was translated in the presence of canine pancreatic microsomal 
membranes (lane 2). Similarly, the RNA with eight adenosines 
(edited form) produced a prominent protein in the predicted 
size range for the unglycosylated GP and could also be 
processed to a higher molecular weight form (lanes 3 and 4). 
Translation of the transcript with seven adenosines produced 
not only SGP, but also a small amount of unglycosylated GP 
that was processed in the presence of membranes. Synthesis of 
GP from the unedited transcript may have occurred via 
translational frameshifting in a -1 direction (20, 21). or may 
have occurred due to addition of an extra nucleotide at the 
editing site (or some other site) by the SP6 RNA polymerase, 
similar to the transcriptional editing carried out by the EBO 
virus polymerase. The processed forms of SGP and GP 
expressed in vitro correspond in size to SGP and GP immu- 
noprecipitated from the cells and TCF of EBOR-infected 
MA-104 cultures (Fig. 14, lanes 5-8). The SGP produced in 
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Fio. 2. Profiles of SGP and GP amino acid sequences of EBO subtypes. SGP and GP share the same N-terminal *300 residues. Identified on 
the schematic drawing are the signal sequences for the SGP and GP. the GP C-terminal transmembrane anchor sequence, an ' an immunosuppressive 
motif found in the GP. Conserved cysteine residues are identified by asterisks. A highly variable central region of the ( . is identified, while all 
other regions are relatively conserved. The lengths of the SGP and GP for the EBO-Z subtype are 364 and 676 amino acids, respectively. 
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Fig. 3. Expression of EBO virus GPs. (A) Fluorogram of [^SJmethionine/cystcine-labcled EBO-R GPs expressed in vitro and in vivo separated 
on SDS/polyacrylamide gels (10% gel). In vitro translation products of run-off transcripts generated from the plasmid pSP73-RESGP-7A (tones 
1 and 2) and pSP73-RESGP-8A (lanes 3 and 4), each pair without and with addition of canine pancreas membranes, respectively. Asterisks next 
to lanes 1-4 identify the unprocessed and larger processed (glycosylated) forms of SGP and GP. The remaining tones contain RIP products (bound 
by an anti-EBO-R GP/SGP monoclonal antibody) from EBO-R (Reston/1989)-iiifectcd MA-104 cell lysates (tones 5 and 6), supernatant fluids 
(lanes 7 and 8), or mock-infected cell supernatant fluids (lanes 9 and 10). Lanes 5, 7, and 9 contain products derived from lysates prepared by SDS 
treatment (followed by Nonidet P-40 and deoxycholate), and lanes 6, 8, and 10 were treated with Triton X-100 (13). At the left edge are marked 
the positions where GP and the broader SGP bands ungrate. Asterisks next to lane 8 identify SGP and GP immunoprccipttated from supernatant 
fluids. (B) Endoglycosidase treatment of the EBO-R GP and SGP. Lane 1 (marker lane) contains the same in vitro translation products as lane 
1 in A The same preparation separated in lane 8 in A was mock-treated (lane 2) or digested with either endoglycosidase H (lane 3) or endogjycosidase 
F/N-gtycosidase F mixture (lane 4). Asterisks identify digestion products. {€) Iramunoprecipitation of SGP and GP from EBO virus-infected cell 
supernatant fluids. Lane 1 (marker lane) contains EBO-Z (Yambuku/1976/057935) virion proteins. The remaining tones are paired (2 and 3, 4 
and 5, etc.) and contain mock-treated (even lanes) and endoglycosidase F/N-glycosidase F-treated (odd lanes) proteins mimunopredpitated with 
a mouse polyclonal anti-EBO-Z GP. RIP products were derived from the supernatant fluids of MA-104 cells infected with three different EBO-Z 
(lanes 2-7; viruses Yambuku/1976/057935 and 057878, and Tandala/1977, respectively), two EBO-S (tones 8-11; viruses Nzara/1979 and 
Maridi/1976), and EBO-R (lanes 12 and 13; Reston/1989). Asterisks identify three prominent GP bands and dcglycosylated products seen in EBO-Z 
lanes. (£>) Demonstration that the SGP of EBO-R is a nonstructural GP. Purified virion preparations, labeled with either f ^(methionine (lane 
1) or PHJglucosamine (lane 2), are shown together with the same RIP products as lane 8 in A Asterisks identify the virion GP in tones 2, and 
the GP and SGP in tone 3. At the left edge are marked the locations of the virion GP, nucfeoprotein (NP), and VP40 (matrix protein). 



vitro, however, did not appear as broad (heterogeneous) as the 
SGP detected in the TCF of EBO-R-infected cultures. 

The secreted EBO-R SGP ranges in size from 50 to 70 kDa, 
with the central major band estimated to be 59 kDa. Endogly- 
cosidase digestion indicated that at least three major species 
are present (Fig. 3B). Removal of all N-Iinked glycans by Endo 
F/NF produced two bands, a smaller predominant species and 
a weaker larger band that may contain O-linked glycans or 
represents some other form of the SGP. The SGP found in the 
supernatant fluids of cells infected with EBO-Z and EBO-S 
are comparable with those seen with EBO-R (Fig. 3C). The 
EBO-Z strains showed strong bands that correspond to the GP 
and SGP described above for EBO-R, but cross-reactivity of 
the anti-EBO-Z GP with the EBO-S and EBO-R subtypes 
resulted in much weaker and thinner bands. Surprisingly, the 
Zaire subtypes produced a third glycosylated protein band 
(**24 kDa) of unknown origin. Fig. 3D shows that the SGP of 
EBO-R is absent from purified viri n preparations (as is the 
case f r the ther EBO subtypes) and is thus a nonstructural 
protein. As with the SGP, the smallest GP n ted above has not 
been detected in viri ns. 

The EBO virus GP genes differed from those of MBG virus 
by at least 55% at the nude tide level and 67% at the amin 
acid level. Even am ng the EBO subtypes a high degree of 
genetic variability was evident. F ur clearly distinct EBO virus 
subtypes (Zaire, Sudan, Reston, and Ivory Coast) were iden- 



tified, differing from one another by 37-41% and 34-43% at 
the nucleotide and amino acid level, respectively. A significant 
finding was that the nucleotide sequence of the Kikwit/1995 
EBO-Z differed from the Yambuki/ 1976/057935 isolate by 
<1.6% over the entire gene. In addition, no amino acid 
sequence differences are predicted for the SGPs of these 
viruses, but a 2% difference in the GP was noted, primarily in 
the variable region where 11 of the 13 amino acid changes 
occurred. 

A detailed phylogenetic profile of the family Filoviridae was 
determined using nucleotide sequences that encode N- and 
C-terminal regions of the GP. These regions contained suffi- 
cient homology to allow accurate nucleotide alignment Max- 
imum parsimony analysis of these sequences produced a single 
most parsimonious tree that clearly separated EBO and MBG 
viruses and divided the EBO subtypes into four distinct dades 
(Fig. 4). 

DISCUSSION 

The finding that all EBO subtypes encode their structural 
(viri n) GPs in two frames and that expression f these GPs 
occurs through transcripti nal editing is highly unusual. Our 
results indicated that there was n sequence variation in the 
EBO vRNA c rresponding t the mRNA editing site when 
analyzed directly from the blood f an acutely infected human. 
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Fig. 4. Phylogenetic relationship of filovirus GP gene sequences. 
Shown is a phylogenetic tree obtained from an alignment of concat- 
enated nucleotide sequences that encode conserved N-terminaJ and 
C-terminal regions of filovirus GP ORFs. The ORF sequences for 
EBO-Z, EBO-IC, and EBOS are nucleotides 46-536 + 1528-1885. 
for EBO-R 49-539 + 1531-1888, and for MBG 2-489 + 1532-1589 
(1 = start of ORF). Maximum parsimony analysis (branch-and-bound 
with a 4:1 weighting of transversions to transitions) produced a single 
most parsimonious tree. The values for bootstrap confidence limits 
(%) for branch points were generated from 1000 replicates of the 
analysis and are indicated in parentheses at branch points. GenBank 
accession numbers for sequences used in phylogenetic analyses are 
U28077, U23187, U28006, U23069. U28134, U234I6. U23417, 
U23152. ZI2132. and Z29337. 

This reinforces our belief that the encoding of the GP in two 
frames maybe important in the maintenance of EBO in nature 
and perhaps also in the pathogenesis of human disease. We 
have no data bearing on the consequences of the circulation of 
a soluble protein bearing the N-terminal 300 amino acids of the 
virion GP, and we have not found homologues of the novel 
SGP sequences (C-terminal 70-77 residues) that suggest a 
specific role in the infectious process. We speculate that the 
SGP may interact with the immune system, either at the 
cellular level, where the result could be cellular deletion, 
anergy, or activation of suppression, or at the humoral effector 
level, where high-affinity antibodies directed against the N- 
terminal portion of the GP may be prevented from acting on 
virions or cells presenting surface GP. Indeed, patients usually 
die without evidence of an effective immune response* and 
even when recovery is underway survivors do not have detect- 
able virus neutralizing antibodies. Our preliminary investiga- 
tions have shown that SGP is detectable in high concentrations 
in the blood f acutely infected patients (data n t sh wn). If 
the SGr does play an important role in the pathogenesis of 
human infection, it may be possible t protect or increase 
survival rates if it can be targeted by specific antibodies early 
in the infecti n. 

The discovery of a third and even smaller 24-kDa glycopro- 
tein in the medium of EBO-Z-infected cells was unexpected. 
This protein, which is reactive with an anti-EBO-Z GP serum, 



may result from cleavage or degradati n fGP or SGP or may 
be produced as a result of a premature terminati n event 
during translation. Further studies are required in order to 
determine h w this protein is produced in infected cells and if 
this protein is unique to certain EBO subtypes. 

The immunosuppressive motif identified in the C-terminal 
third of filovirus GPs is the most conserved sequence se n 
between the GPs of EBO and MBG viruses (7), and may 
contribute to the immunosuppression observed in humans and 
monkeys infected with filoviruses. In addition, this motif may 
have a role in the assembly of the GP into peplomers. 
Immediately upstream and slightly overlapping the N-terminal 
end of the immunosuppressive motifs of all filovirus GPs is a 
region that contains a heptad repeat sequence that Chambers 
et aL (22) identified in the fusion GPs of oncogenic retrovi- 
ruses (the same location with respect to the immunosuppres- 
sive motif of pl5E) and paramyxoviruses. They postulated that 
these repeats may form an extended backbone for the spike 
structures through the coiling of the GPs around one another 
at these regions. 

Phylogenetic analysis of GP gene nucleotide sequences 
clearly distinguishes the EBO subtypes from MBG virus, but 
also indicates that each of the four EBO subtypes represents 
a mon^ohyletic lineage. The close phylogenetic relationship of 
EBO-R strains, filoviruses associated with monkeys exported 
from the Philippines (23-25), to the African subtypes may 
indicate that this EBO-R is not indigenous to Asia but may 
have been introduced to this region from Africa. 

The similarity in the GP genes of the 1976 »nd 1995 EBO-Z 
strains (<1.6% for the entire gene) is surprising, since more 
than 18 years and 1000 km separate the outbreaks caused by 
these viruses. These data may indicate that EBO viruses (and 
filoviruses in general) have evolved to occupy relatively stable 
specific niches in nature and may not change appreciably in 
their natural hosts. The high degree of similarity between the 
1976 and 1995 EBO-Z viruses suggests that the reservoir is the 
same in both locations and that it is widespread in Zaire or is 
a migratory species. In contrast, the divergence seen among 
the different EBO subtypes and between EBO and MBG 
viruses might also imply that these viruses are very old and 
have slowly coevolved with their as-yet-unknown natural hosts, 
which may also demonstrate a similar extent of genetic vari- 
ability. While the genetic diversity among the four EBO virus 
subtypes is high, the apparent stability of each subtype bodes 
well for the development of effective immune or antiviral 
therapy or vaccination strategies. A large scale effort is cur- 
rently underway to attempt to discover the natural reservoir 
for EBO viruses. This includes screening of large numbers of 
vertebrates and invertebrates from the Kikwit area for evi- 
dence of EBO virus infection. Currently, we have insufficient 
knowledge to help narrow the focus of this search, although 
considerable effort is being placed on vertebrates in the 
collection. This decision is influenced by the genetic stasis 
observed between the EBO-Z strains from 1976 and 1995, and 
by the fact that all filoviruses examined possess an immuno- 
suppressive domain in their GP that to date has only been 
found in vertebrate RNA viruses. 

In conclusion, we have described an unusual type of orga- 
nization for the GP genes of all EBO viruses, which has not 
been described for any other type of virus gene encoding a 
structural GP. We feel that the expression of this gene is an 
important element in the pathogenesis of EBO viruses and is 
critical t their maintenance in the natural host. In the future 
we plan toe ntinue studying the expressi n of EBO virus GPs 
in vitro and in vivo, t examine the roles f SGP and GP in the 
disease process, and to determine if these proteins can serve 
as protective immunogens. 
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